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Abstract 

^ We show that the ratio of matched individuals to blocking pairs 

Q grows linearly with the number of propose-accept rounds executed by 
the Gale-Shapley algorithm for the stable marriage problem. Conse- 
^\ quently, the participants can arrive at an almost stable matching even 

04 without full information about the problem instance; for each partici- 

pant, knowing only its local neighbourhood is enough. In distributed- 
X^/y systems parlance, this means that if each person has only a constant 

number of acceptable partners, an almost stable matching emerges af- 
ter a constant number of synchronous communication rounds. This 
Q holds even if ties are present in the preference lists. 

' We apply our results to give a distributed (2 + e)-approximation 

I algorithm for maximum-weight matching in bicoloured graphs and a 

^ centralised randomised constant-time approximation scheme for esti- 

mating the size of a stable matching. 

00 

1 Introduction 

(N 

The social networking boom brings up computational challenges unforeseen 
in the past. In a modern large-scale network, gathering full information 
about the whole network in one place is practically impossible. This moti- 
vates design and analysis of local algorithms |23(|27„32j in which the output 
of a node depends only on the input within a constant number of edges 
(hops) from the node. 

In this paper we study a local version of the classical Gale-Shapley algo- 
rithm |8j for the stable marriage problem. We show that early termination 
of the algorithm leads to a matching with relatively few unstable edges. 
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1.1 Matchings 

We follow the convention (see, e.g., Fleiner [^) that an instance of the 
stable marriage problem is specified by a simple undirected bipartite graph 
S = (i? U B,E); the graph may be disconnected but there are no isolated 
nodes. Call the nodes in R red, the nodes in B blue, and the graph 9 a 
hicoloured graph. Each node v £ RU B has a linear order on the adjacent 
nodes. The linear order constitutes the matching preference for v; if a node 
u is not adjacent to v, then neither u is an acceptable partner for v nor v is 
acceptable for u. 

A set of edges M <^ E is a matching if every node is incident to at most 
one edge in M. Two nodes joined by an edge in M are matched in M; a node 
not incident to an edge in M is unmatched in M. A matching is maximal 
if it is not a subset of a larger matching. A matching is maximum if it has 
the maximum size among all the matchings. In an edge-weighted graph, a 
maximum-weight matching is the one that has the maximum weight among 
all the matchings. A greedy matching is a maximal matching obtained by 
adding the edges one by one, in order of decreasing weight. It is well-known 
that the greedy matching is a 2-approximation to the maximum-weight one. 

Let M be a matching. An edge {u, v} €z E \ M is unstable relative to 
M if both (i) u is unmatched or prefers v over its current match in M; and 
(ii) V is unmatched or prefers u over its current match in M. The matching 
M is stable if there are no unstable edges. Let e > be some constant. This 
work is centred on the following notion of "almost stability" . 

Definition 1. A matching M is e-stable if the number of unstable edges is 
at most e\M\. 

Denote by A the maximum degree of a node of S. That is, A is the 
maximum number of acceptable partners for one participant. In a social 
network, the bound on the degree of a node v may not necessarily be due to v 
being particularly picky; the bound may just reflect the fact that the number 
of participants, information about whom is available (or comprehensible, or 
relevant) to v, is limited. Throughout this work we assume that A is a 
constant. 

1.2 Model of Distributed Computing 

We view S as the communication graph of a distributed system: if {u, i"} G E, 
then the node u and the node v can exchange messages. 

Let T be a constant that determines the trade-off between the stability 
and the running time. Initially, each node v € RUB knows the following in- 
formation: its colour (i.e., whether v G R ot v £ B), its degree d{v), and the 
constant T. The node v has d{v) ports through which it can communicate 
with its neighbours; the ports are numbered according to w's preferences. 
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We assume synchronous communication. On each time step, every node 
can (i) receive messages from its neighbours, (ii) perform deterministic lo- 
cal computations, and (iii) send a message to each of its neighbours. The 
algorithm runs for T synchronous time steps, after which each node needs 
to produce the output: which of the neighbours, if any, is its partner in the 
matching. We also require that the output is consistent: if u announces that 
V is its partner, then v must announce that u is its partner. 

1.3 Statement of Results 

Our main contribution lies in an analysis of the "transient phase" of the 
Gale-Shapley algorithm for the stable marriage problem. We truncate the 
algorithm to a specified number of proposal rounds and investigate the re- 
sulting matching for stability. We find that a constant number of rounds 
suffice to obtain a constant ratio of unstable edges to matching edges. We 
obtain the following result. 

Theorem 1. There exists a deterministic distributed algorithm that finds 
an e-stable matching in a bicoloured graph in time T < 4-\- 2A^/e. 

The algorithm works even in the case when ties in the preference lists 
are allowed. 

We also consider the case where S is an edge-weighted graph, and the 
preferences are determined by the edge weights: each node prefers its inci- 
dent edges in order of their weight, with the heaviest edge being the most 
preferred. If all weights are distinct, the unique stable matching in S is the 
greedy one, and its weight is at least 1/2 of that of the maximum- weight 
matching. In general, an almost stable matching can be a poor approxima- 
tion to the maximum-weight one; say, a super-heavy edge may be missing 
from the almost stable matching. Fortunately, the particular almost stable 
matching from Theorem [T] is much better in this sense. 

Theorem 2. There exists a deterministic distributed algorithm that finds 
a {2 + e)- approximation for maximum- weight matching in bicoloured graphs 
in time T <4 + 2A/e. 

Given access to a bicoloured graph via a preference oracle (queried with 
a node, the oracle replies with the colour of the node and the adjacent nodes 
listed in order of preference), the algorithm in Theorem [T] enables estimation 
of the size of a stable matching using a constant number of queries to the 
oracle. We analyse the non-degenerate case A > 3. 

Theorem 3. For any < 5 < 1/2, < e < 1, and A > 3, there exists a ran- 
domised algorithm that, given access to a preference oracle for a bicoloured 
graph S without isolated nodes, makes at most 25000e~^(A — i)3+4A/e ^-i 
queries to the oracle and outputs with probability at least 1 — 6 an estimate 
rh such that \m — \M\ \ < e\M\ where M is a stable matching in S- 
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Here it should be stressed that the algorithm in Theorem |3] estimates 
the size of a stable matching, not just an e-stable matching. 



1.4 Overview 

The rest of the paper is structured as follows. In Section |2] we discuss 
the specific choices that we make in this work, in particular, why and how 
an almost stable matching differs from a stable matching in a distributed 
setting. Section [3] reviews related work. Sections|4]and|5]present and analyse 
the distributed, truncated version of the Gale-Shapley algorithm. Section |6] 
applies this algorithm to prove Theorems [l}|3| 



2 Motivation and Implications 

2.1 Local Algorithms 

The running time T of the algorithms presented in this work only depends on 
the degree bound A and the desired stability or approximation guarantee e. 
For a constant A and e, these are constant-time algorithms; the running 
time is independent of the number of nodes in the network, the diameter 
of the network, or other global properties. Such constant-time distributed 



algorithms are known as local algorithms [23,27,32 . 

Besides the practical advantage of a low time complexity, our positive 
results also provide insight into the structure of the stable marriage problem. 
We mention three interpretations of Theorem [T] from different perspectives; 
these all follow from the observation that in T synchronous communication 
steps, information can only be propagated from distance T in the network. 

• Robustness and dynamic graphs. The e-stable matching M from The- 
orem [T] is robust in a dynamic network. A change in the network only 
affects M in the radius-T neighbourhood around the point of change. 

• Available information. Information within the radius-T neighbour- 
hood of an edge e is sufficient to determine whether e is part of the 
globally consistent e-stable matching M. 

• Parallelism and circuit complexity. For any graph, we can construct 
a bounded-fan-in Boolean circuit that maps the matching preferences 
to an e-stable matching. The depth of the circuit only depends on the 
constants A and e, not on the size of the graph. 

As we will discuss in the next section, these properties highlight a substantial 
difference between stable and almost stable matchings. 
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2.2 Almost Stable vs. Stable 



While an almost stable matching is robust to change in the preferences, this 
is not the case for a stable matching. For example, consider the following 
graph where the numbered edge ends indicate preference rankings (the most 
preferred match has rank 1). 



Now transpose the preferences shown in boldface to obtain a graph whose 
unique stable matching is edge-disjoint from the unique stable matching in 
the original graph. 



Thus, a single transposition in preferences can force every match to break up. 

From the perspective of distributed computation, this example shows 
that every node must have essentially complete information about the net- 
work to arrive at a stable matching. In particular, if each node initially 
knows only its neighbours and its own preferences, the number of commu- 
nication steps required between neighbours is linear in the diameter of the 
network. 

Gathering global information is hardly practical in a very large-scale dis- 
tributed system, especially if the network undergoes frequent changes. The 
ease with which almost stable matchings can be computed in a distributed 
setting comes as a positive surprise. 

2.3 Definition of Almost Stable 

Our results are somewhat oblivious to the exact definition of an "almost 
stable" matching. The number of unstable edges (blocking pairs) appears 
to be generally accepted as a basic measure of instability [l][3||5][T9] , but one 
could equally well consider other measures. For example, one can consider 
the number of nodes that are endpoints of unstable edges. 

Naturally we must measure stability in relation to some other quantity; 
an absolute guarantee of stability cannot be achieved in constant time. We 
have chosen to measure the number of unstable edges relative to |M|, as 
this gives us the strongest results when compared with other quantities such 
as \R\, \B\, \R[J B\, \E\, and each of which is at least as large as 

\M\. Eriksson and Haggstrom |5 provide arguments in favour of comparing 
unstable edges to \E\. 
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2.4 Marriages vs. Roommates 



An immediate question is whether our results apply to the stable roommates 
problem - the non-bipartite version of the stable marriage problem. It is 
easy to see that a non-bipartite graph need not have an e-stable matching for 
e < 1: consider a triangle with "rotational-symmetric" preferences - there 
is one unstable edge to every matching edge. 



2.5 Bicoloured vs. Bipartite 

A graph is 2-colourable if and only if it is bipartite. Given a global view 
of a bipartite graph, it is trivial to 2-colour the graph. In the context of 
distributed algorithms this relationship is more subtle, however. Linial [23] 
shows that there is no constant-time algorithm for finding a maximal match- 
ing in a cycle with 2n with vertices. Czygrinow et al. ^ show that finding 
a constant-factor approximation to maximum-cardinality matching with a 
deterministic constant-time algorithm is not possible either. However, these 
negative results heavily rely on the fact that the nodes in the bipartite 
graph do not know their colours in a 2-colouring. Theorem [2] shows that 
knowledge of the colour is not only necessary but also sufficient to find a 
constant-factor approximation of maximum-weight matching with a deter- 
ministic distributed constant-time algorithm. 

The approximation factor of 2-|-e in Theorem[2]is almost the best possible 
for any algorithm (distributed or centralised) that has access only to the 
relative order of the weights, and not their numerical values. To see that no 
algorithm can have the approximation factor 2 — e for any e > 0, consider 
the following graph with the weight a > 1. 



If a ^ 1 , an algorithm must include the edge with weight a into the output 
(otherwise the algorithm has no approximation guarantee at all). But then 
the algorithm, having access only to the relative order of the weights, must 
include this edge also when 1 < a < 2/(2 — e), which contradicts with the 
approximation guarantee 2 — e. 
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3 Related Work 



The stable marriage problem remains a subject of active research. The book 
by Gusfield and Irving [9] is a comprehensive survey on the problem; see the 
MATCH-UP workshop [lO] for the latest developments. 



3.1 Almost Stable Matching 

Abraham et al. [IJ study almost stable matchings in the stable roommates 
problem. The recent work by Biro et al. [3j is particularly close to ours: 
they, too, consider the stable marriage problem with incomplete preference 
lists, and aim at finding a matching with few unstable edges. However, in 
terms of computational complexity, their work goes in the opposite direc- 
tion. Their task is to find a maximum matching that minimises the number 
of unstable edges. It turns out that this makes the problem computation- 
ally much more difficult: the problem is NP-hard, unlike the classical stable 
marriage problem. In contrast, we do not require that the matching is a 
maximum matching, which makes the problem computationally easier: the 
problem admits a constant-time distributed algorithm, unlike the classical 
stable marriage problem. The algorithm works even when ties in the pref- 
erences lists are allowed; this should be contrasted with the fact that if ties 
are allowed, it is NP-hard to find a stable perfect matching (IT 16 17 . 



3.2 Distributed Stable Matching 

Everyone witnesses evolution of matchings in real-world and virtual-world 
social networks; in its simplest form, the matchings attempt to attain stabil- 
ity by switching along unstable edges. In a seminal work, Knuth |20 showed 
that such a switching may go in cycles, never resulting in a stable matching. 
Roth and Vande Vate [3l] showed that switching randomly almost surely 
leads to stability. 

Note that switching partners along unstable edges can be done in a dis- 
tributed manner. The Gale-Shapley algorithm is also parallel by its nature: 
the proposals/rejects can be undertaken by all men/women simultaneously 
during synchronised rounds (albeit it can happen that only one man is free 
at a round J9| Section A. 3], [34]). Lower bounds on the running time of the 
algorithm (Oj Section 1.5] show that a linear number of rounds is required 
to attain stability. But can a nearly stable matching be obtained with fewer 
rounds? 

Several works have addressed the last question with experiments. Quinn 
|30| observes experimentally that a matching with only a fraction of unstable 
edges emerges long before the Gale-Shapley algorithm converges. Lu and 
Zheng |24| propose a parallel algorithm that outperforms the Gale-Shapley 
algorithm in practice. Theorem [T] gives theoretical support to the findings in 
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Quinn [30| . Theorem [T] al so addresses the concern expressed in the conclu- 
sions of Lu and Zheng [24] where it is claimed that "Most of existing parallel 
stable matching algorithms cannot guarantee a matching with a small num- 
ber of unstable pairs within a given time interval." Theorem [T] suggests 
that if the number of acceptable partners for each participant is bounded, 
the Gale-Shapley algorithm guarantees a small relative number of unstable 
edges. 

From a theory perspective, apparently only a few papers address de- 
centralised implementations of the Gale-Shapley algorithm and/or stability 
after early termination of a stable matching algorithm. In a recent paper [s] 
it is claimed that "little theory exists concerning instability." KhuUer et 
al. [19] give bounds on the performance of a simple online algorithm. Feder 
et al. |6] show that a stable matching can be found on a polynomial number 
of processors in sublinear time; their algorithm is not local. Other work on 



parallel stable matching include Tseng and Lee |34| , Tseng |33|, and Hull 15 
Eriksson and Haggstrom j:5 prove that a simple heuristic works well for ran- 
dom inputs. Our Theorem [T] shows that the Gale-Shapley algorithm works 
well for an arbitrary input. 

3.3 Local Algorithms and Matchings 



As we mentioned in Section |2.5[ there is a range of negative results related to 
local algorithms (constant-time distributed algorithms) for maximal match- 
ing [23] and approximate maximum matching (4)[2l][22]|26]. Even if each 
node is assigned a unique identifier and the network topology is an n-cycle, 
it is not possible to break the symmetry in the network and find a constant- 
factor approximation for maximum matching. Without any auxiliary infor- 
mation beyond unique node identifiers, positive results are known only in 
rare special cases, most notably for graphs where each node has an odd 



degree 25 27 



Hence some auxiliary information is needed. For example, in problems 
related to stable marriage, it is natural to assume that every participant 
knows his/her gender. This assumption has not been exploited much in 
prior work on distributed, deterministic constant-time algorithms. We are 
only aware of Hahckowiak et al. [12^, which shows that there is a constant- 
time algorithm for maximal matching in bi coloured bounded-degree graphs. 
Similarly to our algorithm, their algorithm does not need unique node iden- 
tifiers; port numbering [2^ is sufficient. 

Other work on constant-time distributed algorithms for matching usually 
assumes either randomness [14|21p2|28p5 or geometric information [13p6 



We refer to the survey 132 for further information on local algorithms. 
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3.4 Centralised Constant-Time Algorithms and Matchings 

Our centralised constant-time approximation algorithm in Theorem |3] is 



based on the ideas of Parnas and Ron 29 and Nguyen and Onak [28]. 
Their work presents constant-time approximation algorithms for estimat- 
ing the size of a maximal matching, maximum-cardinality matching, and 
maximum- weight matching. Our work complements this line of research by 
presenting an algorithm for estimating the size of a stable matching. 



4 Algorithm 

We work with the following distributed variant of the Gale-Shapley algo- 
rithm. Each blue node b & B maintains the variable p{b) which is the match 
of b in the current matching or _L (6 is unmatched). Each red node r £ R 
maintains the following variables: C(r) is the list of candidates for matching; 
c(r) is the candidate from whom r is waiting for a response or _L (r has no 
current candidate); and p{r) is the match of r in the current matching or _L 
(r is unmatched). 

The algorithm executes in rounds. Each round consists of two turns, a 
blue turn followed by a red turn. Each node v G RD B is active during turns 
of its colour. 



Blue turn. Each blue node b & B completes the following read-compute- 
write cycle. Initially, p{b) = _L. 

1. Receive all incoming messages; let P be the subset of neighbours that 
sent the message 'propose'. If P is empty, do nothing during this turn. 

2. If p{b) / _L, then set Q ^ P U {p{b)}; otherwise set Q ^ P. 

3. Let q be the node in Q most preferred by b. 

4. If g/p(6): 

(a) If p(6) 7^ _L then send the message 'break' to p{b). 

(b) Send the message 'accept' to q. 

(c) Set p{b) ^ q. 

5. For each r £ P \ {q}: 

(a) Send the message 'reject' to r. 
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Red turn. Each red node r E R completes the following read-compute- 
write cycle. Initially, C(r) consists of all adjacent blue nodes, in order of 
decreasing preference, most preferred first; c(r) = _L; p{r) = _L. 



1. If c(r) ^ ±: 



(a) Receive a message m from c(r). 

(b) If m = 'accept' then p{r) c(r). 

(c) If m = 'reject' then remove c(r) from C{r). 

(d) c(r) ^ ±. 



2. If p{r) + ±: 



(a) Receive a message m from p(r), if any. 

(b) If m = 'break' then remove p(r) from C(r), and set pir) <— _L. 
3. If p(r) = _L and C(r) is not empty: 

(a) Let c(r) be the first element of C(r). 

(b) Send the message 'propose' to c(r). 

5 Analysis 

We adopt the convention of using a subscript z = 1, 2, ... to denote the state 
of the algorithm at the end of round i. For example, Pi{r) denotes the value 

of the variable p(r) in the local state of r at the end of round i. 

At the end of round i = 1,2,..., the local state variables pi{-) define a 
matching 



5.1 Lost edges and convergence to stability 

We start from a restatement, in our terms, of the fact that as the Gale- 
Shapley algorithm progresses, women only improve their match and men 
only get worse. 

Lemma 1. An edge {r, b} e E\Mi with r e R and b e B is unstable in Mi 
only if b is present in Ci{r) and r is unmatched at the end of round i. 

Proof. If b is not in Ci{r), then either b rejected r or broke up with r earlier, 

which means that b prefers its current match to r. If b is in Cj(r) and r is 
matched, then r has not proposed to b yet, which means that r prefers its 
current match to b. □ 



Mi = {{r,pi{r)} 

= {{b,Pim 



r e R, pi{r) / _L} 
b G B, pi{b) / X} C 
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During each round, each red node r £ R removes at most one blue node 
b £ B from C(r). We say that such an edge {r,b} is a lost edge because 
it can no longer occur in the matching. We write Li <^ E for the set of all 
edges lost by the end of round i. Note that Li = because only 'propose' 
messages are sent during the first round. 

Because E is a finite set and Li^i Li <^ E, there exists a z € {2, 3, . . .} 
such that Lz = L^-i- Since received 'reject' and 'break' messages increase 
the number of lost edges, it follows that no such messages were received 
during round z. In particular, any unmatched r £ R at the end of round 
z must have Cz-i{r) empty (and hence Cz(r) empty), because otherwise 
it would have received a 'reject' or 'break' during round z. It follows by 
Lemma [T] that is a stable matching. Let us denote this stable matching 
by Moo in what follows. 

5.2 Weight and potential 

Associate with each edge e £ E a positive integer weight 'w{e) such that 
the weights respect the preferences of each node; that is, whenever a node v 
prefers x over y, it holds that w{{v, x}) > w{{v, y}). This is always possible, 
e.g., by choosing w = 1. In fact, to prove Theorem [T] taking u; = 1 is 
sufficient, while the proof (and actually the statement) of Theorem |2] deals 
with an arbitrary w. 

Associate with each blue node b £ B the weight Wi{b) = w{{b,pi{b)}) if b 
is matched; otherwise 'Wi{b) = 0. The total blue weight, Wi{B) = YlbeB ^«(^)) 
is equal to the total weight of the matching Mj. 

Associate with each red node r £ R a potential fi{r) as follows. If r is 
matched or Ci{r) is empty, set fi{r) = 0. Otherwise, set /i(r) = u;({r, 6}) 
where b is the first element of Cj(r). 

Intuitively, the potential /j(r) is an upper bound for the extra weight 
that r could have if we ran the algorithm further than i rounds. In a stable 
matching, fi{r) = for all r £ R, but we do not necessarily achieve this 
by running the algorithm for a constant number of rounds. However, we 
can derive an upper bound for the total potential fi{R) = J^renfii''") 
observing that lost edges are heavier than those along which proposals (will) 
go. 

Lemma 2. For all i = 2,3, .. . it holds that fi{R) < w{Li) — ?x;(Lj_i). 

Proof. Each r £ R that has a positive potential at the end of round i > 2 
has either received a 'reject' or 'break' during the round i; in both cases r 
has removed a unique blue node b from C(r). The weight of the lost edge 
{r, 6} is at least the potential fi{r) because C(r) is ordered by decreasing 
preference. The claim follows by taking the sum over all red nodes and edges 
lost during round i. □ 
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Lemma 3. For all i = 2,3, .. . it holds that fi{R) < fi-i{R). 

Proof. Consider the red turn at round i. The potential of a red node r €z R 
changes only if it receives a message. First, if r receives a 'reject' message 
during round i, the potential of r at end of round i may change but it does not 
increase, that is, fi{r) < /j_i(r). Second, if r receives a 'break' message from 
a blue node b, there is a unique red node q that receives an 'accept' message 
from b. In this case the potential of q decreases from fi~i{q) = 'w{{q,b}) 
to fi{q) = while the potential of r increases from /j_i(r) = to < 
w{{r,b}) < w{{q,b}) = Third, if r receives an isolated 'accept' 

message (without an associated 'break' message), fi{r) = < /j_i(r). The 
claim follows by taking the sum over all 'reject' messages, all 'break'-'accept' 
matches, and all isolated 'accept' messages during round i. □ 

Lemma 4. For all i = 2,3, .. . it holds that w{Li) > {i — l)fi{R). 
Proof. By Lemma |2] and Lemma [3j we have 

i i 

w{Li) = w{U) - w{Li) = J2 - w{Lj-i) > ^ fj{R) > {i - l)f^{R). 

j=2 j=2 

□ 

Lemma 5. For all i = 2,3, .. . it holds that w{Li) < (A — l)wi{B) 

Proof. A blue node b £ B can lose an incident edge {r, 6} during round 
j = 2,3, . . . ,i only if b is matched at the end of round j with a node that b 
prefers to r. Put otherwise, if {r, 6} G Lj \ Lj-i, then 'w{{r,b}) < Wj{b) < 
Wi{b). The last inequality follows because each b £ B, once matched, only 
changes to a more preferred match. Furthermore, a blue node can lose at 
most A — 1 incident edges. □ 

Lemma 6. For all'y > and i > H-(A — 1)/7 it holds that fi{R) < jWi{B). 
Proof. By Lemma |4] and Lemma [5} □ 



6 Applications 

6.1 Proof of Theorem [l] 

For the purpose of analysis, set w^e) = 1 for all e £ E. Then |Mj| = Wi{B), 
so Lemma [g] and i > 1 + (A — l)/7 imply fi{R) < 7|-/Vfj|. Denote by Ui the 
number of unstable edges in E \ Mi. Because fi{R) counts the number of 
unmatched red nodes with a nonempty Cj (r) , it follows from Lemma [T] that 
Ui < Afi{R). Thus, 7 = e/A gives m < e\Mi\ for i > 1 + A(A - l)/e. 

We can choose an integer i < 2 + A^/e. We can compute Mi in i rounds 
or r = 2i < 4 + 2A^/e synchronous communication steps. □ 

The proof for the case when ties in the preference lists are allowed, is 
analogous. 
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6.2 Proof of Theorem [2] 

Let a positive integer weight w{e) for each edge e £ E he given as input. 
Assign the preferences of each node v so that whenever w{{v, x}) > w{{v, y}) 
for two edges {v,x}, {v,y} £ E, v prefers x to y. Execute the algorithm in 
Section |4] for i rounds to obtain the matching M,. Let M* <Z E he a 
maximum-weight matching. 

For each red node r £ R, let g(r) C B consist of the matches of r in Mi 
and in M*, if any. In particular, \g{r)\ < 2 for all r £ R, and < 2 

for all b£ B. 

Let {r, 6} £ M* with r £ R and b £ B. Exactly one of the following 
holds at the end of round i: 

1. The node r has not received a response from b. If r is matched with 
b2 b, r prefers 62 over b and w{{r,b}) < u;({r, 62}) = Wi{b2)- Other- 
wise r is unmatched, b is in Cj(r), and tt;({r, 6}) < fi{r). 

2. The node r has received a response from b. Then b is matched with r or 
with some other red node that b prefers over r. Thus w{{r, 6}) < Wi{b). 

For every {r, 6} G M* thus 

</.(r)+ w;,(c) 
ceg(r) 

and hence 

U)(M*) = ^ w{{r,b]) 

{r,b}eM* 

= /i(i?) + 2^i;i(S). 

From Lemma |6] we conclude that 

w{M*) < (2 + e)ii;i(S) = (2 + e)u;(Mi) 

whenever i > 1 -|- (A — l)/e. 

We can choose an integer i < 2 + A/e. We can compute Mj in i rounds 
01 T = 2i < 4 + 2 A/e synchronous communication steps. □ 

6.3 Proof of Theorem H 

Let us first recall a convenient Chernoff-type upper bound for the tail prob- 
ability of a binomial random variable Z ~ Bin(A;,p), where A: is a positive 
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integer and < p < 1. For < /? < 1, we have 

Fi {\ Z -pk\> (3pk) < 2 ex.-p(^-^^^y (1) 

(See Janson et al. |18, Corollary 2.3] for a proof.) 

Let < 6 < 1/2, < e < 1, and A > 3 be given. For the purpose 
of analysis, set w^e) = 1 for all e £ E. We estimate the size of the stable 
matching M^o in S using a randomised algorithm that queries the preference 
oracle for S. We assume that the number of nodes, n, is given as input to 
the algorithm. 

Let us first relate \Mj\ to |Moo|. We have \Mj\ < \Moo\ because every 
blue node, once matched, remains matched. Furthermore, |Moo| < |Mj| + 
fj{R) because every red node r £ R with empty Cj{r) will be unmatched in 
Moo. Let 

e 2A - 2 A - 1 

7 = — , 7 > IH = H . (2) 

8A' - e 4A7 ^ ' 

Note that 7 < 1. By Wj{B) = \Mj\ and Lemma [g] we have 

\Mj\ < |Moo| < (1 +4A7)|Mj-| < \Mj\ + | • |Moo|. (3) 

It thus suffices to have an estimate for \Mj\. 

Because 9 has no isolated nodes, \R\ < A|S| and \B\ < A|i?|. Because 
a stable matching is maximal and there are no isolated nodes, 

A|Moo| > \R\, (4) 
A|Moo| > \B\. (5) 

By ^ we have |Moo| <2\Mj\ and thus 

2|<l^.l<l^l- (6) 

Since n = \R\ + we have 

< \R\ < n. (7) 



A + 1 

Let 



iVo= ^ 2 ^"x' A^>A^o. (8) 



6(A + 1)^ 6 
7^ 

We use the following procedure to estimate \Mj\. First, select N nodes 
uniformly at random and query the oracle for their colour. Denote by X 
the number of red nodes among the N nodes. Then, select red nodes 
uniformly at random; that is, select 2(A + 1)A^ nodes uniformly at random 
and select the first N red nodes among those (if not enough red nodes 
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occur, output 'failure' and stop); denote by Z the number of red nodes 
obtained (that is, failure occurs if and only \i Z < N). For each red node, 
we determine whether it is matched in Mj; denote by Y the number of 
selected red nodes that are matched in Mj. Output 

X Y 

and stop. 

To analyse the procedure, let us first derive upper bounds for the prob- 
abilities of certain "bad" events. 

Let us first derive an upper bound for the probability of failure. Let 

p=^. . = 2(A + l)iV. and ^ 

By ([7| we have f3 < 1/2. Furthermore, by ([T]) we have that Z < N implies 
\Z - pk\ > N = f3pk. Thus, from and (|8j) it follows that 



Ft{Z <N)<2 exp 
= 2 exp 



3 

Nn 



6(A + l)|i?| 



/ N \ 6 

Let us now derive an upper bound for the probability that the random 
variable X/N significantly deviates from its expectation \R\/n. Let 

I -Rl 

p= — , k = N, and /? = 7. 
n 

Observe that /3 < 1 by ([2]). By ([TJ, (Q, and we have 
X 



Pr 



> pp^ = Pr(|X - pk\ > ppk) 
< 2 exp 
= 2 exp ( 



(Tpk 
3 

P'^\R\N 

\ 3n 

( (3^N \ 5 
< 2exp f-^ ^ < -. 

Finally, let us now derive a similar bound for the probability that the 
random variable Y/N significantly deviates from its expectation |Mj|/|i?|. 
Let 

;,= i^, k = N, and /3 = 7. 
\R\ 
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Observe that /3 < 1 by ([2|. By Q, (|6|, and Q, we have 
Y 



Pr 



N 



> pp^ =Fi{\Y -pk\ > Ppk) 



< 2 exp 



2exp^- 



< 2 exp 



(3'^\Mj\N 

m 

_0^N_ 
'6(A + 1) 



< 



Next we show that, with probabihty at least 1 — 6, the estimate rh 
approximates |Moo| with the claimed accuracy. Observe first that \ac—hd\ < 
a\c — d\+ d\a — h\ for a, b,c,d > 0. Thus, with probability at least 1 — 6, both 
no failure occurs and 



m - \Mi 



n 



< n 



< n 



X Y 
N ' N 
X 

N 
X 
N 



\R\ \Mi 



n 



\R\ 



Y 


\Mj\ 


+ 


Mj\ 




X 




N 


\R\ 


\R\ 




N ~ 


n 




Mj\ . 1 


Mj 








7- 


\R\ 


\R\ 


- • 7 • 


B) 

n J 





(9) 



< 2-fn = 2'y{\R\ + \B\) < 4:Aj\Mo, 



where the last inequality follows by Q and ([5|. Thus, with probability at 
least 1 — S, we have 



< \m 



|Moo|| 

= e|Moo| 



where the last inequality follows by ^ and 

It remains to derive an upper bound for the number of oracle queries 
we make to compute the estimate. First, we make N queries to determine 
X. Second, we make 2(A + 1)A^ queries to obtain N red nodes (or declare 
failure). Third, for each of the red nodes, we execute the algorithm in 
Section |4] for j rounds to determine whether the node is matched in Mj. 
Each round of the algorithm propagates information (messages) for at most 
two hops in S, which implies that we can decide whether any given r £ R 
is matched in Mj if we know the preferences of all nodes within distance 2j 
from r in 9- There are at most 



2i-i 



1 + A^(A- 1)^ = 1 + 



A 



i=0 



A 



((A - 1)2^ - 1) < 3(A- 1 



i2i 
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such nodes. Because A > 3 and 6 < 1/2, we have 



No = 384A2(A + I)e-2(ln6 + ln5-i) 




— 1 



Therefore we can select integers j and N such that j < 2Ae and A'^ < 
6250(A — l)^e~^ lnS~^. In total we make at most 



Acknowledgements 

This work was supported in part by the Academy of Finland, Grants 116547, 
117499, and 118653 (ALGODAN), by Helsinki Graduate School in Computer 
Science and Engineering (Hecse), and by the Foundation of Nokia Corpora- 
tion. 

We acknowledge email exchange with the members of the Algorithms 
and Complexity Research at Glasgow group, and thank the organisers of 
ALGO 2008 for the creative atmosphere at the conference. 

References 

[1] David J. Abraham, Peter Biro, and David F. Manlove. "Almost stable" match- 
ings in the roommates problem. In Proc. 3rd Workshop on Approximation and 
Online Algorithms (WAOA, Palma de Mallorca, Spain, October 2005), volume 
3879 of Lecture Notes in Computer Science, pages 1-14, Berlin, Germany, 2006. 

Springer- Vcrlag. 

[2] Dana Angluin. Local and global properties in networks of processors. In Proc. 
12th Annual ACM Symposium on Theory of Computing (STOC, Los Angeles, 
CA, USA, April 1980), pages 82-93, New York, NY, USA, 1980. ACM Press. 

[3] Peter Biro, David F. Manlove, and Shubham Mittal. Size versus stability in 
the marriage problem. In Proc. 6th Workshop on Approximation and Online 
Algorithms (WAOA, Karlsruhe, Germany, September 2008), Berlin, Germany, 
2008. Springer- Verlag. To appear. 

[4] Andrzej Czygrinow, Michal Hanckowiak, and Wojcicch Wawrzyniak. Fast dis- 
tributed approximations in planar graphs. In Proc. 22nd International Sym- 
posium on Distributed Computing (DISC, Arcachon, Prance, September 2008), 
volume 5218 of Lecture Notes in Computer Science, pages 78-92, Berlin, Ger- 
many, 2008. Springer- Verlag. 



N + 2{/^ + l)N + 3(A - if^N < 4(A - if^N 

< 25000e-2(A- l)3+4^/^ln(5 



— 1 



queries. This completes the proof. 



□ 



17 



[5] Kimmo Eriksson and Olle Haggstrom. Instability of matchings in decentralized 
markets with various preference structures. International Journal of Game 
Theory, 36(3-4) :409-420, 2008. 

[6] Tomas Feder, Nimrod Megiddo, and Serge A. Plotkin. A sublinear parallel 
algorithm for stable matching. Theoretical Computer Science, 233(1-2) :297- 
308, 2000. 

[7] Tamas Fleiner. A fixed-point approach to stable matchings and some applica- 
tions. Mathematics of Operations Research, 28(1):103-126, 2003. 

[8] David Gale and Lloyd S. Shapley. College admissions and the stability of 
marriage. The American Mathematical Monthly, 69(1):9-15, 1962. 

[9] Dan Gusfield and Robert W. Irving. The Stable Marriage Problem: Structure 
and Algorithms. The MIT Press, Cambridge, MA, USA, 1989. 

[10] Magnus M. Halldorsson, Robert W. Irving, Kazuo Iwama, and David F. 
Manlove, editors. MATCH-UP: Matching Under Preferences - Algorithms 
and Complexity. Satellite workshop of ICALP 2008, July 2008. 

[11] Magnus M. Halldorsson, Robert W. Irving, Kazuo Iwama, David F. Manlove, 
Shuichi Miyazaki, Yasufumi Morita, and Sandy Scott. Approximability results 
for stable marriage problems with ties. Theoretical Computer Science, 306(1- 
3):431-447, 2003. 

[12] Michal Hahckowiak, Michal Karohski, and Alessandro Panconesi. On the dis- 
tributed complexity of computing maximal matchings. In Proc. 9th Annual 
ACM-SIAM Symposium on Discrete Algorithms (SODA, San Francisco, CA, 
USA, January 1998), pages 219-225, Philadelphia, PA, USA, 1998. Society for 
Industrial and Applied Mathematics. 

[13] Marja Hassinen, Evangelos Kranakis, Valentin Polishchuk, Jukka Suomela, and 
Andreas Wiese. Analysing local algorithms in location-aware quasi unit-disk 
graphs, 2008. Manuscript. 

[14] Jaap-Henk Hoepman, Shay Kutten, and Zvi Lotker. Efiicient distributed 
weighted matchings on trees. In Proc. 13th International Colloquium on Struc- 
tural Inform,ation and Communication Complexity (SIROCCO, Chester, UK, 
July 2006), volume 4056 of Lecture Notes in Computer Science, pages 115-129, 
Berlin, Germany, 2006. Springer-Verlag. 

[15] M. Elizabeth C. Hull. A parallel view of stable marriages. Information Pro- 
cessing Letters, 18(2):63-66, 1984. 

[16] Robert W. Irving, David F. Manlove, and Crcgg O'Mallcy. Stable marriage 
with ties and bounded length preference lists. Journal of Discrete Algorithms, 
2008. To appear. 

[17] Kazuo Iwama, David F. Manlove, Shuichi Miyazaki, and Yasufumi Morita. 
Stable marriage with incomplete lists and ties. In Proc. 26th International Col- 
loquium on Automata, Languages and Programming (ICALP, Prague, Czech 
Republic, July 1999), volume 1644 of Lecture Notes in Computer Science, pages 
443-452, Berlin, Germany, 1999. Springer-Verlag. 

[18] Svantc Janson, Tomasz Luczak, and Andrzcj Rucinski. Random Garphs. John 
Wiley & Sons, Inc., New York, NY, USA, 2000. 



18 



[19] Samir KhuUer, Stephen G. Mitchell, and Vijay V. Vazirani. On-line algorithms 
for weighted bipartite matching and stable marriages. Theoretical Computer 
Science, 127(2) :255-267, 1994. 

[20] Donald E. Knuth. Mariages Stables. Les Presses de I'Universite de Montreal, 
1976. 

[21] Fabian Kuhn. The Price of Locality: Exploring the Complexity of Distributed 
Coordination Primitives. PhD thesis, ETH Ziirich, 2005. 

[22] Fabian Kuhn, Thomas Moscibroda, and Roger Wattenhofer. The price of 
being near-sighted. In Proc. 17th Annual ACM-SIAM Symposium on Discrete 
Algorithms (SODA, Miami, FL, USA, January 2006), pages 980-989, New 
York, NY, USA, 2006. ACM Press. 

[23] Nathan Linial. Locality in distributed graph algorithms. SIAM Journal on 
Computing, 21(1):193-201, 1992. 

[24] Enyue Lu and S.Q. Zheng. A parallel iterative improvement stable match- 
ing algorithm. In Proc. 10th International Conference on High Performance 
Computing (HiPC, Hyderabad, India, December 2003), volume 2913 of Lecture 
Notes in Computer Science, pages 55-65, Berlin, Germany, 2003. Springer- 
Verlag. 

[25] Alain Mayer, Moni Naor, and Larry Stockmeryer. Local computations on 
static and dynamic graphs. In Proc. 3rd Israel Symposium on the Theory 
of Computing and Systems (ISTCS, Tel Aviv, Israel, January 1995), pages 
268-278, Piscataway, NJ, USA, 1995. IEEE. 

[26] Thomas Moscibroda. Locality, Scheduling, and Selfishness: Algorithmic Foun- 
dations of Highly Decentralized Networks. PhD thesis, ETH Ziirich, 2006. 

[27] Moni Naor and Larry Stockmeyer. What can be computed locally? SIAM 
Journal on Computing, 24(6): 1259-1277, 1995. 

[28] Huy N. Nguyen and Krzysztof Onak. Constant-time approximation algorithms 
via local improvements. In Proc. 49th Annual IEEE Symposium on Founda- 
tions of Computer Science (FOCS, Philadelphia, PA, USA, October 2008), 
pages 327-336, Los Alamitos, CA, USA, 2008. IEEE Computer Society Press. 

[29] Michal Parnas and Dana Ron. Approximating the minimum vertex cover in 
sublinear time and a connection to distributed algorithms. Theoretical Com- 
puter Science, 381(1-3):183-196, 2007. 

[30] Michael J. Quinn. A note on two parallel algorithms to solve the stable mar- 
riage problem. BIT Numerical Mathematics, 25(3):473-476, 1985. 

[31] Alvin E. Roth and John H. Vande Vate. Random paths to stability in two-sided 
matching. Econometrica, 58(6):1475-1480, 1990. 

[32] Jukka Suomela. Survey of local algorithms. Ihttp : //www. iki . f i/jukka 
|suomela/local-surveyl 2008. Manuscript in preparation. 

[33] S. S. Tseng. The average performance of a parallel stable marriage algorithm. 
BIT Numerical Mathematics, 29(3):448~456, 1989. 

[34] S. S. Tseng and R. C. T. Lee. A parallel algorithm to solve the stable marriage 
problem. BIT Numerical Mathematics, 24(3):308-316, 1984. 



19 



[35] Mirjam Wattenhofer and Roger Wattenhofer. Distributed weighted matching. 
In Proc. 18th International Symposium on Distributed Computing (DISC, Am- 
sterdam, Netherlands, October 2004), volume 3274 of Lecture Notes in Com- 
puter Science, pages 335-348, Berlin, Germany, 2004. Springer- Verlag. 

[36] Andreas Wiesc and Evangelos Kranakis. Loc;al maximal matc;hing and local 
2-approximation for vertex cover in UDGs. In Proc. 7th International Confer- 
ence on Ad-Hoc Networks & Wireless (AdHoc-NOW, Sophia Antipolis, France, 
Septemher 2008), volume 5198 of Lecture Notes in Computer Science, pages 
1-14, Berlin, Germany, 2008. Springer- Verlag. 



20 



